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I.  INTRODUCTION 

The  development  cf  computer  and  information  processing 
has  come  to  the  stage  of  being  able  to  handle  Korean  and 
Chinese  character  input  and  output.  There  is  no  problem  in 
information  systems  for  the  input  and  output  of  characters 
from  a  standard  Roman  character  keyboard,  but  the  problems 
related  to  non-Roman  characters  from  I/O  to  software  prob- 
lems of  language  handling  remain  almost  unsolved.  Until 
recently  the  computer  could  not  handle  Korean  or  Chinese 
characters  efficiently.  It  was  not  user  friendly  and  data 
processing  in  Korea  was  imperfect  and  very  unwieldy.  Among 
the  problems,  the  biggest  issue  is  how  to  enter  2,369  Korean 
and  1,800  common  Chinese  characters  from  the  standard  Roman 
character  keyboard. 

During  the  last  few  years,  there  have  been  great  efforts 
at  universities,  research  institutes  and  manufacturers  for 
the  development  of  good  I/O  devices  for  Korean  characters. 
In  Korea,  natural  language  processing,  .  especially  Korean 
language  processing,  is  one  of  the  essential  elements  for 
the  future  of  computer  and  information  systems. 

lirst  the  properties  of  Korean  and  Chinese  characters 
will  be  presented  as  an  introduction  for  those  unfamiliar 
with  these  characters.  Then,  the  resolution  power  of  CRT's 
and  dot  matrix  printers  and  their  relation  to  the  shape 
characteristics  (readability,  asthetic  quality,  etc.)  of 
Korean  and  Chinese  characters  will  be  discussed.  The 
methods  which  are  developed  for  Korean  and  Chinese  character 
I/O  can  be  applied  to  other  character  sets,  especially  to 
many  ncn-Roman  alphabetic  character  sets,  not  to  mention 
Chinese  characters  in  China. 


II.  3ACKGE00ND 

A.  PROPERTIES  OF  KOREAN  DOCUMENTS 

Common  documents  in  Korea  are  usually  written  in  a  nixed 
form  utilizing  Korean  and  Chinese  characters.  Minor  use  is 
made  cf  Reman  script.  The  usage  of  each  character  set 
depends  on  the  kind  cf  document.  In  order  to  perform  word 
processing  efficiently  in  Korea,  the  simultaneous  editing  of 
these  characters  is  essential.  Table  I  shows  the  use  of 
characters  found  for  various  types  of  documents.  This  data 
is  based  on  sampling  performed  expressly  for  this  study.  The 
following  sources  were  in  the  sampling  process  to  construct 
Table  I: 

1.  newspaper   -   Korean   Daily   Times,    "3A   Era",    16 
September  1984 

2.  journal  -  "National  Security",  June  1984 

3.  technical  papers  (A)   -   "COBOL  Programming",   Cong-A 
publishing  Co.,  1978 

4.  technical  papers  (B)   -   "Introduction  to  Law",   Beob 
Moon  Sa  publishing  Co.,  1978 

5.  business  papers  -  Korean  Air  Lines  Co. 

Although  the  sample  was  taken  from  a  single  source  for  each 
kind  of  document,  it  is  the  authors'  view  that  the  documents 
selected  are  representative  of  the  entire  population  of  each 
type. 

B.  CHARACTERISTICS  CF  KOREAN  SCRIPT 

The  native  Korean  alphabet  was  introduced  in  1446,  after 
centuries  of  the  use  of  a  more  cumbersome  method  (known  as 
IDU)   to  transcribe  Korean  with  Chinese  characters.   The  set 
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TABLE  I 
Proportions  of  Written  Characters 


News- 
paper 

I 

Journal 

I 

Technical 
(A)   | 

paper 
(B) 

I  Business 
paper 

Roman 
script 

1% 

I 
I 

3% 

I 

I 

40% 

0% 

I    10% 

Korean 
script 

8  4% 

I 
I 

7  6% 

I 
I 

55%    | 

5  5% 

80% 

Chinese 
character 

15% 

I 

21% 

| 

5%    | 

45% 

i    10% 

® 


Technical  Dapers  from  western  countries 
Traditional  and  historical  papers 


of  28  letters1  (now  24  letters)  was  designed  by  a  group  of 
scholars  commissioned  by  King  Sejong  (1419  -  1450),  the 
fourth  King  of  the  Yi  dynasty. 

The  Korean  language  and  alphabet  is  spoken  and  written 
by  an  estimated  50  million  people  on  the  Korean  peninsula 
and  its  coastal  islands.  Many  among  the  approximately  one 
million  Koreans  residing  in  Japan,  China,  and  America  still 
speak  and  write  the  language  [Ref.  9]. 

The  Korean  alphabet  currently  used  consists  of  14  conso- 
nants (jt_  _l-_  _c_  j2_  a_  _d_  _A_  _p_  JL  JL  JL  3-  3L  JL)  an^  10 
vowels  (_£_  Jijt.  rL^-JkJLJL^Z.  _D  •  There  are  also  17 
compound  consonants  (Ti  ta  a  lS  cc  jn  SQ  jiB  2A  E£  2X  2o  pn 
^M  ^A  M  XX)  and  11  compound  vowels  (H_  H_  j[_  ^j_  ±fc  ±H  ii  fL 
ffj  jj_  --{_)  .  The  letters  of  the  Korean  alphabet  cannot  be  used 
independently  but  are  used  to  build  syllables.  Each  Korean 
character  consists   of  two  or   three  parts.   The   first  part 


*A  letter  is  an  element  of  a  character.  The  character 
consists  of  two  or  three  letters.  Letters  in  Korea  are  a  set 
of  14  consonants  and  10  vowels. 
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must  be  a  consonant  or  compound  consonant.  There  are  19 
letters  that  are  possible  for  the  first  part  of  the  Korean 
character.  They  are  typically  consonants  or  compound  conso- 
nants (_x  T]  L-  c _  cc  a  o  d  dti  A  M  o  7>  XX  A  =?  £  1 
_p_)  .  The  second  part  of  the  Korea  character  is  typically  a 
vowel  or  compound  vowel. There  are  21  possible  letters  for 
the  second  part  of   the  Korean  character  (_h_  _H_  Jp_     JjL  _^_  ^[_ 

i  i  i  iL  iil  ±L  iL  Jl  Jl  Jrl'  _l!  JL  -ZI  -ri  _L)  •  The  third 
part  of  the  Korean  character  is  optional  and  depends  on  the 
character  being  depicted.  The  third  part  if  present,  must 
be  a  consonant  or  a  compound  consonant.  There  are  23 
letters  possible  as  the  third  part  [l_  n  lA  l  Ljj  L*  c  a. 
S7  HQ  Eg  HA  e§  Eo  £  D7  b)  MA  A  M  o_  *_  *.  _f7_  _£_  JE_  3_ 
g£) .  This  section  has  been  summarized  in  Figure  2.1. 

The  Korean  system  of  writing  is  called  "Hangul".  It  is 
"phonetic"  writing,  like  English,  in  the  sense  that  the 
symbols  represent  sounds,  that  is,  consonants  and  vowels. 
Unlike  English  symbols,  which  are  grouped  directly  into 
words  (e.g.,  E+n+g+1+i+s+h  =  English),  Korean  symbols  are 
first  grouped  by  syllable  (e.g.,  H+a+n  g+u+1  =  Han  gul) 
[Hef.  10]. 

Korean  symbols  are  written  in  syllabic  groupings.  An 
enumeration  method2  is  to  put  letters  side  by  side  as  in 
"LONDON".  But  the  Korean  language  stacks  the  letters  in  most 
characters.  For  example,  "LONDON"  would  be  written  _y_  n  . 
The  simplest  syllable  is  written  with  one  consonant  and  one 
vowel.  When  one  writes  the  symbol  for  a  vowel  alone,  one 
must  add  the  consonant  symbol  "_0_",  which  indicates  an 
initial  mute  (which  is  closed  as  a  consonant) .  In  this 
simple  consonant  and  vowel  syllable,  there  are  two  types  of 
arrangements;    side-by-side  arrangement   (e.g.,   7_h_)    and 


2In  an  enumeration  method  letters  are  olaced  side  by 
side  or  element  by  element  using  a  set  of  "consonants  and 
vowels. 
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*  .  THE  CHARACTERISTICS  OF  KOREAN  CHARACTER 

.  THE  KOREAN  ALPHABET  CONSISTS  OF  24  BASIC  LETTERS*  ELEMENTS )', 

14  consonants:  ucsoaAHitncDo 
10  vowels  :  h  c  i  4  i  u  t  n  _  i 

.  EACH  CONSONANT  AND  VOWEL  CAN  BE  COMPOUNDED 
.  POSSIBLE  COMPOUND  CONSONANTS 

TJ  1A  LX  LS  CC  21  20  2U  2A  2c  2D  2o  Dl  tib  tiK    AA  XX 
.  POSSIBLE  COMPOUND  VOWELS 

H   tl   it   41   it-   1H   it   Ti   HI   Tl   _l 

*.  EACH  CHARACTER  CAN  BE  DIVIDED  INTO  THREE  PARTS  <  FIRST 
SOUND,  MIDDLE  SOUND,  FINAL  SOUND)  OR  TWO  PARTS  (FIRST  AND 
SECOND  SOUND). 

.  THE  FIRST  PART  MUST  CONSIST  OF  A  CONSONANT  OR  A  COMPOUND  CONSONANT 
THE  SECOND  PART  MUST  CONSIST  OF  A  SINGLE  OR  A  COMPOUND  VOWEL 
THE  THIRD  PART  IS  OPTIONAL.  IF  USED,  IT  MUST  BE  A  CONSONANT- 
.  THE  FOLLOWING  LETTERS  CAN  BE  USED  AS  THE  FIRST  PART; 

TTJLccranatiHAAAHX-xaKiciis 

19  LETTERS 
.  THE  FOLLOWINC  LETTERS  CAN  BE  USED  AS  THE  SECOND  PART; 

I-  H  t  tl  1   -B  4  41  i  ih  1H  il  U  T  T1  T-H  Tl  n  _  _l  I 

21  LETTERS 
•    THE  FOLLOWING  LETTERS  CAN  BE  USED  AS  THE  THIRD  PART; 

1  Tl  TA  L  LX  LS  C  2  21  20  3d  2A  2c  20  23  Q  01  ti  HA  A  AA  H  X  n  1 

c     n     •     ;     28  LETTERS 
*.   NUMBER  OF  POSSIBLE  COMBINATIONS  OF  CHARACTER  =  19*23*29  =  11,571 

IN  PRACTICE,    ONLY  ABOUT  2,  400  CHARACTERS  ARE  USED. 


Figure    2.1         The    Korean    Alphabet. 
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top-to-bottom  arrangement  (e.g.,  JL_)  .   The  particular  vowel 
being  written  determines  which  arrangement  is  used. 

Eepresenting  these  character  syllables  through  a 
computer  creates  a  problem  because  each  letter's  (consonant 
and  vowel)  shape  can  be  different  due  to  a  requirement  that 
each  character  be  balanced,  i.e.,  have  the  same  size  and 
achieve  a  desired  asthetic  quality.  For  example,  when  _~J_  is 
placed  to  the  left  cf  a  vowel,  the  downward  portion-  is 
slanted:  J7_  (e.g.,  7jr_)  •  When  it  is  placed  on  top  of  the 
vowel,  the  downward  portion  becomes  straight:  _H_  (e.g., 
_H  )  .  As  shown  above,  it  is  very  difficult  to  apply  these 
different  shapes  for  a  particular  letter  to  a  line  printer 
and  a  typewriter.  This  problem  will  be  discussed  in  detail 
in  the  following  chapter. 

By  mathematical  calculation,  the  possible  number  of 
Korean  characters  is  11,571  (  19  *  21  *  29  ).  It  must  be 
noted  through  that  cnly  2,369  characters  are  commonly  used 
[Bef .  8:  p.  11  ]. 

C.   CHARACTERISTICS  OP  SINO-KOREAN  CHARACTEBS 

Sino-Korean  characters  are  Chinese  characters  used  in 
Korea.  They  are  different  from  those  used  in  China.  Koreans 
refer  to  Chinese  characters  as  Hanja.  Chinese  characters 
have  a  long  history,  the  earliest  discovered  writings  having 
been  dated  from  about  14  B.C..  In  100  A.D.  during  the  Han 
Dynasty,  this  was  modified  by  Hsu  Sheng  (sft'BL  30  -  124  )  in 
his  15  -  Volume  paleographical  work,  Shuo-wen  Chieh-tzu, 
(JS.JlM-!^)  which  translates  to  the  explanation  of  writing 
and  analysis  of  words.  That  work  lists  9,353  characters 
under  540  radical  entries.  Of  this  number,  36  4  are  picto- 
graphic,  125  simple  idiographic,  1,167  compound  idiographic 
and  7,697  phonetic  ccmpounds. 
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The  most  complete  collection,  the  Kang  Hsi  Dictionary 
with  about  50,000  characters  was  published  in  1716.  Since 
1949,  after  the  establishment  of  the  Peoples  Republic  of 
China,  the  Chinese  government  actively  pursued  language 
reform  until  the  Cultural  Revolution,  1966-1976.  The  Chinese 
government  changed  and  simplified  the  characters  from  the 
original   [Ref.    5:    p.    15]. 

The  number  of  characters  used  commonly  is  from  1,000  to 
3,000.         Table    II    [Ref.    1:       p.         819]   shows    the    frequency    of 


TABLE  II 

i 

Frequency 

of  Chinese  Characters  Used  in  Documents 

News- 

General 

Total    I  News- 

General 

papers 

Document 

Document  papers 

Document 

(*) 

[%) 

(%)   I  (chrs) 

(chrs) 

1st  10  chrs 

10.0 

8.8 

80 

499 

638 

50 

27.5 

25.  5 

85 

615 

777 

100 

38.9 

36.  1 

90 

781 

992 

200 

55.4 

51.0 

95 

10  68 

1358 

500 

79.0 

73.5 

96 

1156 

1479     j 

1000 

93.  1 

89.0 

97 

1269 

1617 

1500 

97.4 

95.0 

98    | 

1421 

1832 

2000 

98.7 

97.  6 

99 

1661 

2157 

2500 

98.9 

99.4 

100 

2879 

3323 

3000 

99.8 

i 

*    chrs : 

acronym  of  characters 

Chinese    characters    used    in    typical    documents. 

In  1972,  the  Korean  ministry  of  Education  suggested  that 
1,800  Chinese  characters  be  learned  and  used  for  educational 
purposes  [Ref.  3].  In  this  study,  the  authors  will  restrict 
themselves    to      that    set    of      1,800    characters.  The   Chinese 

characters    are    called      Hantzu    in    Chinese,       Hanja      in    Korean, 
and    Kanji      in   Japanese.       All      mean    "Han      Characters"     (<j£jtL)  • 
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These  characters  are  used  exclusively  in  Chinese  writings, 
and  in  combination  with  the  Hangul  (Korean)  alphabet  in 
Korea  and  with  the  Kana  Syllabaries  in  Japan.  The 
Sino-Korean  (Hanja) ,  in  written  form,  is  a  combination  of 
three  major  elements:  pictograms  and  ideograms,  and 
phonograms  [Eef.  5:  p.  22]. 

In  the  next  chapter  the  perspective  of  a  picture  for 
each  character  will  be  used  because  of  both  the  complexity 
of  Chinese  chracters  and  the  ease  of  representation  in  the 
computer.  Each  Chinese  character  has  the  meaning  and  sound, 
for  example,  J|_  means  heaven  and  the  sound  is  cheon.  Also, 
there  are  many  characters  which  have  different  meanings  tut 
the  same  sound,  or  the  same  meaning  but  different  sounds. 
In  order  to  solve  this  problem  there  are  several  methods. 
Appendix  A  [Ref.  5:  p.  17]  shows  the  evolution  of  Chinese 
characters. 
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III.  PROBLEMS  OF  EDITING  KOREAN  AND  CHINESE  SCRIPTS 

A.   CORPENT  EDITING  TECHNOLOGY 

The  current  word  processing  practice  in  Korea  is  to  type 
Korean  characters  by  the  enumeration  method,  that  is,  input 
letters  (8  bit  code:  consonant  and  vowel  in  sequence)  and 
output  these  letters  as  a  character  syllable  using  a  Korean 
character  conversion  program  for  Korean  script.  Appendix  B 
shows  the  EECDIC  input  codes  currently  used  by  FACOM,  and 
Appendix  C  depicts  MES  (Mahawk  Data  Sciences)  input  codes 
used  by  IBM.  To  type  Chinese  characters  the  following 
sequence  is  followed: 

1.  Depressing  a  Chinese  character  function  key. 

2.  Typing   the  sound   character  of   a  Chinese   character 
using  the  enumeration  method. 

3.  Displaying   all  homonym   (from  1   to  60)    characters 
[Ref.  4:  p.  34]  that  have  the  same  sound. 

4.  Selecting  one  character  by  using  an  index  number,  and 
entering  the  character  to  a  buffer  or  file. 

Machines  dealing  with  Korean  language  data  are  currently 
available  from  the  IBM  and  FACOM  corporations  in  Korea; 
IBM's  Multistation  5550  (1984)  and  FACOM  OS  I V( KEF)  ( 1982) 
are  newly  updated  and  well  developed  machines.  These 
machines  still  have  several  disadvantages  in  handling  Korean 
and  Chinese  characters: 

1.  A  large  amount  of  time   is  spent  in  character  conver- 
sion. 

2.  It  is  difficult  to  directly  delete  and  insert  records 
in  a  file. 

3.  The  word  processing  editor  cannot  recognize  the  char- 
acters  being   edited  before   executing   a   character 
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conversion  program  since  only   the  enumerated  letters 
can  re  displayed, 
a.   The  method  of  entering  characters  is  inconvenient  and 
requires  a   tremendous  amount   oi  effort   for  Chinese 
characters. 

5.  One  cannot  convert  all  Korean  character  syllables 
into  Chinese  characters  because  there  is  not  a  one  to 
one  mapping. 

6.  Data  communication  is  impossible  since  there  are  no 
standard  codes  for  Korean  and  Chinese  characters. 

Appendices  D  and  E  show  the  keyboard  of  IBM  Multistation 
5550  [Ref.  8:  p.  14]  and  FACCM  OS  IV  (KEF)  [ Ref .  7:  p.  48] 
respectively. 

B.   OSIR  REQUIREMENTS 

Most  potential  users  have  recognized  that  the  computer 
is  essential  in  data  processing  and  office  automation. 
However,  because  of  the  above  constraints,  they  are  unsatis- 
factory for  use  with  the  Korean  language.  Some  general  user 
requirements  of  computer  researchers  and  manufacturers  are 
the  following: 

1.  Users  want  to  use  Korean  language  commands  and 
programs  but  there  are  no  Korean  language  oriented 
operating  systems  or  programming  languages  such  as 
COECL,  FORTRAN,  Pascal,  etc. 

2.  Users  want  to  edit  three  kinds  of  characters  simulta- 
neously and  in  a  user  friendly  manner. 

3.  Users  want  to  display  and  print  out  data  without 
using  a  conversion  program,  as  is  done  with  the 
Korean  alphabet  because  of  time,  memory  space,  and 
inconvenience  . 

4.  Users  want  to  use  interactive  files  and  database 
processing. 
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In  summation,  they  want  to  use  computers  that  handle  three 
kinds  of  script  in  the  same  manner  in  which  present 
computers  do  with  the  Roman  alphabet. 

C.   PROBIEMS  OF  REPRESENTATION  OF  THE  THREE  KINDS  OF  SCRIPTS 

Because   of  the   characteristics  of   Korean  and   Chinese 
characters,  the  following  problems  occur: 

1.  Hew  can  one  enter  2,400  Korean  characters  and  1,300 
Chinese  characters  into  a  computer  through  a  limited 
numter  of  keystrokes. 

2.  How  can  one  develop  the  system  program  to  direct 
input  and  output  without  using  a  conversion  program. 

3.  How  can  the  asthetic  quality  of  display  and  output  be 
improved. 

4.  Hew  can  one  increase  the  processing  speed  and  reduce 
the  memory  space  for  these  character  definitions. 

There  are  other  problems  tut'  the  above  problems  are  the 

most  significant.   Amcng  these  problems  the  first  one  is  the 

most  serious  and  significant  problem,  and  consequently,  the 
authors  will  give  it  more  attention  in  this  study. 
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IV.  POSSIBLE  METHODS  FOR  KOREAN  LANGUAGE  DATA  PROCESSING 

In  order  to  solve  the  problems  which  were  mentioned  in 
the  previous  chapter,  the  following  methods  are  offered  as 
possible  alternatives  for  Korean  language  data  processing. 

A.   8-BIT  CODE  FOR  KOREAN  ALPHABET 

Since  the  Korean  alphabet  consists  of  only  24  letters 
and  Korean  language  data  can  be  expressed  using  only  Korean 
characters  without  a  serious  problem.  The  enumeration 
method,  like  the  Reman  alphabet,  is  the  easiest  way  to 
represent  Korean  characters  without  changing  the  hardware 
and  the  operating  system.  This  method  is  not  highly  readable 
and  would  require  changes  in  the  language  which  may  not  be 
acceptable  to  users. 

1  •   Usin£  the  Current  S tanda rd  Keyboard 

A  program  can  be  loaded  which  defines  the  24  letter 
Korean  alphabet  to  a  character  generator  instead  of  the 
lower  case  Roman  alphabet.  All  Korean  alphabet  elements  and 
the  upper  case  Roman  alphabet  characters  are  then  available 
through  the  standard  Roman  character  keyboard.  With  this 
method  the  user  can  use  a  computer  in  a  similar  manner  as 
the  users  who  use  the  Roman  alphabet.  In  addition,  well 
developed  hardware  and  software  can  be  used  without  critical 
problems.  This  method  has  been  suggested  by  many  groups  of 
people  from  the  time  when  the  Korean  typewriter  was  first 
developed.  The  only  disadvantage  is  the  breaking  cf  tradi- 
tional custom.  To  capitalize  on  developed  technology  and  for 
the  ease  of  application,  more  study  and  research  should 
center   on  user   acceptability   of   the  enumeration   method. 
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Figure  4.1  shows  an  example  of  hard  copy  which  uses  a 
graphic  dot  printer  and  a  standard  keyboard.  Appendix  F 
shows  the  load  command  program   for  an  alternative  character 


ii  sro  o_i  uia 

Lh  0-1    AJ-2  CH.  11  SCO  0.L  111*  HI    L_L  AfL  1i5  0T2  1_A  tiT3  1_A 
MA  CH    1T-B   »    0+    2IL   CIO  H  1_  Al")   03  Ai  Ah  2  CiL   HH -  >    1.  2IU 
0_d  LI    O. 


Figure  4.1    Exaaple  Osicg  Standard  Keyboard. 

generator  for  the  Korean  alphabet.  This  prograr  can  be 
generated  easily  by  the  alternative  character  set  editor, 
and  it  loads  the  Korean  alphabet  to  an  alternative  character 
generator  instead  of  the  lower  case  of  Roman  alphabet. 

2-   Using  the  Capital  Letters  is  the  Initial  Letter 

The  major  difficulty  with  the  enumeration  method  is 
poor  readability.  Kcrean  users  read  a  sentence  sequentially 
syllable  by  syllable.  In  order  to  increase  readability,  the 
initial  letter  of  each  character  can  be  written  as  an  upper 
case  letter  to  distinguish  the  syllable  easily.  Figure  4.2 
shows  the  example  using  the  capital  letters  and  Appendix  G 
represents  the  load  command  program  foe  these  letters.  A 
special  mark  or  altered  shape  of  each  letter  aLso  can  be 
applied  to  increase  a  readability  when  an  enumeration  method 
is  used. 
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Lj-Oj  ^-aCHu  "Lo>oO-l  IL.UiL-l  A-1/L2  0t2~La  thai-/, 
0-aLiCM 


Figure  4.2    Example  Using  Capital  Letters. 

B.   16-BIT  CODE  FOB  THE  THREE  KINDS  OF  SCRIPT 

There  are  various  methods  one  can  use  to  enter  Korean 
and  Chinese  characters,  but  the  16-bit  code  is  one  of  the 
tetter  methods,  since  it  can  identify  all  possible  Korean 
and  Chinese  characters  without  using  the  enumeration  method 
and  a  conversion  program.  The  structure  of  this  code  will  be 
discussed  briefly  in  the  following  subsection. 

1*   J6-bit  Code  fee  Korean  Script 

As  mentioned  tefore,  a  Korean  character  syllatle 
consists  of  three  parts: 

1.  First  sound;  cne  of  19  simple  or  double  consonants. 

2.  Second  sound;  cne  of  21  simple  or  compound  vowels. 

3.  Third  sound;   cne  of  28  simple  or  compound  consonants 
(optional) . 

Since  the  number  of  each  first,  second,  and  third 
letters  is  less  than  32  letters,  5  bits  are  enough  to  iden- 
tify each  sound.  All  possible  Korean  characters  can  be  iden- 
tified using  15  bits.  The  1st  bit  of  16  bits  is  used  to 
indicate  a  Korean  character  (by  a  0) .  The  next  5  bits  are 
used  for  the  first  sound,  the  following  5  tits  for  the 
second  sound,  and  the  final  5  bits  for  the  third  sound. 
Table  III  shows  the  structure  of   16-bit  code  for  the  Korean 
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character  and  Table  IV  explains  the  16-bit  code  for  Korean 
character.  This  code  table  is  basically  the  same  as  the  IB*! 
2-byte  internal  Korean  character  code  [Ref.  6:  p.  52].  The 
only  difference  is  the  arrangement.  Some  IBM  codes  represent 
three  letters.  This  makes  key  tops  (face  of  each  key)  more 
complex;  for  example,  00100  (one  key  top)  represents  _l_,  H  , 
and  HAv  values.  The  code  suggested  in  Table  TV  reduces  seme 
of  this  complexity  ty  limiting  the  possible  values  to  no 
more  than  two  for  each  keytop.  In  contrast  to  the  example 
for  IBM  codes,  the  same  code  from  Table  IV  represents  only 
one  value.    Appendix   H  represents  the  IBM   2-byte  internal 


TABLE  III 
Structure  of  16-bit  Code  for  Korean  Script 


1st  Byte- 


>l< 


2nd  Byte- 


> 


I   I   I 


< — 1st  sound — X 2nd  sound  —  X 

5  bits     I     5  bits 


I   I   I 


3rd  sound--> 
5  bits 


*  0:  Korean  character 


Hangul  code  for  the  Korean  character. 

The  suggested  code  has  several  advantages.  First,  it 
is  easy  to  sort  the  character  order  by  its  value  since  the 
value  of  each  letter  is  in  the  order  of  the  Korean  alphabet. 
Second,  it  can  reduce  the  memory  space  for  data  by  using  2 
bytes  instead  of  3  bytes  for  one  character.  Third,  it  is 
possible  to  edit  the  character  directly  since  it  does  not 
need   code   conversion.     Finally,    since   it   can   easily 
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recognize  the  code  value  of  the  Korean  character,  it  helps  a 
programmer  when  it  is  programmed. 




TABLE    IV 
16-bit  Code   for   Korean   Script 

5    bit 
code 

1st 
sound 
letter 

2nd 
sound 
letter 

3rd 
sound 
letter 

5    bit 
code 

1st 

sound 
letter 

2nd 
sound 
letter 

3rd 
sound 
letter 

ooooc 

10000 

^~ 

Eo 

00001 

"7 

"7 

10001 

a 

a 

00010 

11 

h 

17 

10010 

iL 

01 

00011 

H 

TA 

10  011 

d 

u 

00100 

L 

L. 

10100 

Bd 

T 

00101 

1= 

LX 

10101 

T| 

UA 

00110 

H 

LS 

10110 

A 

A 

00111 

c 

c 

1011  1 

AA 

*l 

AA 

01000 

CL 

H 

1  1000 

O 

O 

01001 

d. 

2 

11001 

X 

X 

01010 

H| 

Zl 

11010 

77 

rl 

01011 

=1 

ED 

1101  1 

* 

7T 

A 

01100 

=11 

2a 

1  1100 

^ 

— 

=7 

01101 

J- 

2A 

11101 

E 

H 

E 

011  10 

A- 

2£ 

11110 

—£" 

l 

X 

01111 

M\ 

£1 

11111 

J- 
o 

o 

... 

*    Blank:    Not    used 
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2  •   16-bit  Code  for  Chinese  Characters 

There  is  no  limitation  in  the  number  of  usable 
Chinese  characters,  but  statistics  show  that  1,800-3,000 
characters  cover  98-99.8  percent  of  those  which  appear  in 
newspapers  and  journals  (Table  II).  Currently  there  are 
only  two  ways  to  represent  Chinese  characters  in  Korea.  One 
method  is  comprised  cf  two  steps.  The  first  step  is  to 
display  all  Chinese  characters  (synonym)  which  have  the  same 
sound  after  entering  the  desired  sound,  and  the  second  step 
is  to  enter  the  Chinese  character  which  is  needed  by  the 
user  via  an  index  number  matched  to  that  character  after 
selecting  it  in  the  display.  The  other  method  is  to  convert 
a  Korean  character  to  a  Chinese  character  after  typing  a 
Korean  character  as  a  unit  of  a  word,  which  consists  of  two 
or  three  characters. 

The  former  is  inconvenient  and  takes  a  long  time  to 
edit.  The  latter  has  no  flexibility  in  that  it  is  limited  by 
the  programmed  word  cedes.  To  solve  the  above  problem  and 
simplify  the  identification  of  each  character  using  a 
limited  number  of  keystrokes,  a  16-bit  code  for  Chinese 
characters  can  be  applied.  Table  V  represents  the  structure 
of  16-bit  code  for  Chinese  characters. 

Chinese  characters  represent  both  meaning  and 
phonetics  to  Koreans.  To  simplify  the  code,  all  the 
complete  meaning  and  sound  of  the  Chinese  characters  are  not 
needed.  The  Chinese  characters  are  composed  of  from  one  to 
five  syllables  for  meaning  and  one  character  for  the  sound. 
Simplicity  can  be  achieved  by  employing  abbreviations  or 
acronyms  for  each  part  (meaning  and  sound) .  For  example,  a 
Chinese  character  (J§_)  has  a  meaning  as  "Hea-Ven"  and  sound 
as  cheon.  In  this  case  we  use  H  of  Hea,  V  of  Ven,  and  C  of 
cheon  as  a  code  for  (  35.  1  . 
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TABLE  7 
Structure  of  16-bit  Code  for  Chinese  Characters 


1st  Byte- 


■>!< 2nd  Byte 


— > 


I   I   I 


I   I   l 


I   I   I 


I . 


<— 1st  letter-X — 1st  letter — X — 1st  lette 


of 

1st  meaning 
character 


of 

2nd  meaning 
character 


of 

3rd  meani 
characte 


r--> 

ng 

r 


*  1:  Chinese  character 


But  this  method  may  result  in  duplicate  codes  for 
different  Chinese  characters  which  mean  another  character 
and  may  have  the  same  value  as  HVC.  In  order  to  eliminate 
the  duplicate  code  and  to  use  the  3  letter  code  which  is 
compatible  with  the  16-bit  Korean  character  code,  the 
following  characteristics  of  the  sound  and  meaning  of 
Chinese  characters  are  relevant:  First,  only  428  syllables 
are  used  to  represent  the  sounds  for  all  Chinese  characters. 
That  is,  one  sound  can  represent  1  to  60  Chinese  characters. 
Second,  the  frequency  of  Korean  characters  used  for  the 
meaning  and  sound  is  irregular  in  distribution.  More 
specific,  20%  of  Korean  characters  are  used  to  represent  the 
sound  and  meaning  of  95%  of  Chinese  characters  [Ref.  11]- 

As  a  result  of  analyzing  the  1,300  sound  characters 
and  1,438  meaning  characters  used  to  represent  the  Chinese 
characters,  Table  VI  and  Table  VIII  are  derived.  Table  VI 
represents  the  number  of  Chinese  characters  which  have  the 
same  first  sound  letter  and  the  same  second  sound  letter. 
For  example,  266  Chinese  characters  have  the  first  sound 
letter  f  1  ) ,   44  Chinese  characters   have  (  *1  )       as  a  first 
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TABLE    VI 
Characters   Having   Same    1st   S    2nd   Letter   Sound 

2nd 
letter 

1st   letter 

1 

L 

C 

£ 

D 

fcl 

A 

0 

* 

X 

E 

IT 

o 

Group 

H 
H 

44 
12 

8 
6 

26 
8 

17 
4 

9 

20 
13 

27 
12 

62 
4 

25 

5 

17 

41 
11 

22 
7 

171 
8 

11 
2 

27 
9 
5 

H 

Group 

H 

Hi 
=l 
=41 

18 

1 

34 

15 

4 

1 

26 

12 

11 
12 

42 
6 

12 

43 
4 

59 
14 

28 
2 

— 

7 

6 

7 


18 
3 

Group 

J. 

JL 

34 

22 

1 

4 

8 

5 

2 

36 

13 

2 
3 

18 
5 

21 

28 

2 
1 

16 
7 

2 
13 

28 
4 

1 

15 
2 

9 
1 

10 

4 

24 
24 

8 
3 

T 
Group 

T 
7f 

32 



6 

3 
5 

— 

6 

5 
12 

16 

33 

__| 

44 
___ 
— 

22 
13 

14 
23 

22 

18 
— 6' 

5 
I 

4 

6 

1 

2 

4 

Group 

-1 

1 

20 

16 

1 
1 

4 

1 
13 

14 

19 

10 

37 

12 

10 
29 

8 
36 

4 
1  8 

1 

9 

3 

6 

Subtotal 

263 

27 

81 

105 

98 

135 

237 

265 

224 

122 

,1 

53 

148 

Total      1,800 
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sound  letter  and  ( )  as  a  2nd  sound  letter.  One  must  rear- 
range the  sound  acronym  to  the  5  bit  code  since  the  distri- 
bution of  sound  characters  is  irregular.  Table  VII  depicts 
the  rearranged  code  value  for  acronym  of  the  Chinese  sound 
character. 


TABLE  VII 
5-bit  Code  for  the  Acronym  of  Sound  Character 


code 

sound  acronym 

code 

sound  acronym 

00000 

■?v 

10000 

A 

00001 

ZL 

10001 

o\- 

00010 

& 

10010 

H 

00011 

i 

10011 

i 

00100 

SL 

10100 

o 
T 

00101 

L- 

10101 

o_ 

00110 

C 

101  10 

*V 

00111 

£• 

10111 

7A 

01000 

• 

11000 

£. 

01001 

□ 

11001 

T 

01010 

u- 

11010 

z*. 

01011 

y 

• 

1101  1 

X 

01100 

Ah 

11100 

E 

01101 

M 

11101 

UL 

01110 

/^ 

11110 

j. 

01111 

T 

11111 

o" 

• 

In  Table  VII  the  second  letter  (_t_/_d_/_:L /_T_/_r_) 
describes   all   the   group   letters.    For   example,    (  jrj 
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represents  H  .  ____,  ____,  _}___,  and  (_._}_.)  ;  __[_,  ____,  =i  . 
_J_l_,  and  (____-_)  ;  _-_-_,  _-____,  __x___,  _-_H_,  _-_-!_,  etc.  Also  (_H__) 
represents  _H__,  ___j group  which  assembles  the  first  conso- 
nants at  the  left  of  the  vowel.  (_!_)  describes  __-___,  _T_, 
and  _____  group  which  assembles  the  first  consonants  above  the 
vowel . 

Since  the  frequencies  of  Korean  character  syllables 
representing  Chinese  character's  sound  and  meaning  are 
different,  the  frequency  of  Korean  characters  to  represent 
Chinese  characters  meaning  is  needed  to  be  analyzed.  After 
analyzing  the  sampled  1,438  characters  which  are  the  first 
and  the  second  meaning  characters,  Table  VIII  is  derived 
which  shews  the  number  of  times  for  a  meaning  character  or  a 
group  to  be  used. 

The  meaning  acronym  value  to  a  5-bit  code  from  the 
basis  of  Table  VIII  can  be  reassigned.  Table  IK  shows  the 
reassigned  5-bit  codes  representing  the  acronym  of  the 
meaning  character.  The  same  theory  can  be  applied  as  in 
Table  VII  when  Table  IX  is  derived.  As  the  acronym  code  is 
rearranged,  the  proportion  of  the  duplicate  codes  can  be 
reduced.  As  a  result  of  applying  these  rearranged  codes, 
Table  X  can  be  produced  which  shows  the  proportion  of  the 
duplicate  codes.  The  pure  acronym  code  (Table  IX)  repre- 
sents the  acronym  of  a  meaning  and  a  sound  character  as  a 
first  letter  code  of  Korean  characters  (__7  T7.j_._C_  cc.  _e_  n_ 
____.__.  A  _?_  __.  2__  Z2__S.iL  JLiL  __L;  15  possible  conso- 
nants), the  arranged  sound  character  acronym  code  (Table 
VII) ,  and  the  arranged  sound  and  meaning  character  acronym 
code  (Table  VIII) . 

The  reasons  why  some  duplicate  codes  cannot  be  elim- 
inated are:  First,  some  Chinese  characters  have  similar 
meaning  and  sound  which  generates  the  same  acronym  code  (22 
among  1,800);  and  second,  there  are  initially  some  Chinese 
characters   which  have   the  same   meaning  sound   (  12   among 
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TABLE  VIII 
Freguency  of  Meaning  Character  in  Chinese  Characters 


2nd  letter 

1st 

It 

h  group 

^  group 

-L  group 

Tgroup 

—  grou  p 

TOT 

236 
23 
73 

103' 
13 

107 

93 

89 

9 

107 
10 

246 
93 
8 
40 
14 
23 
17 
69 

1438 

-J 

TJ 

u 
c 
cc 
e. 
a 

d 

A 

M 
o 

X 

2* 
X 

£ 
IL 

JL. 

o 

h 

42 

1 

23 

23 

1 

10 

39 

24 

2 

32 

5 

28 

18 

2 

5 

2 

8 

4 

32 

301 

H 
6 
4 
8 
9 
1 
6 

13 
5 
2 

11 

6 

5 

4 
1 
3 
2 
2 

87 

25 
2 
7 

12 
3 
5 

10 

11 
1 

21 
1 

29 
8 

6 

4 
1 

2 

149 

9 

2 
1 

8 

4 
2 

2 

I- 

=*l 

22 

3 

7 

3 

13 

2 

— 

19 

1 

70 

2 

9 

24 

10 
17 
11 

15 
15 
15 
6 
1 
3 
1 
3 
1 
6 

143 

2 

1 

1 

7 
16 

6 
1 
1 
1 

2 

2 

4 
2 
2 

19 

T  JTt 

27  |  3 

4|  1 
71 
171 
2] 

4| 
421! 
15| 

..... 

17] 

—  i" 

22  |  4 

-li'fl 
..... 

-;- 

1 
"2\- 

"IV 

182  8 

9 

1 

2 
1 

1 
2 

1 

17 

26 
4 

13 

11 
1 

28 
1 

1 

11 

4 

57 
3 

5 
9 
2 
1 
2 

187 

I 

30 
4 
2 
3 
1 

31 
4 

10 

19 

55 

39 

4 

13 

1 

— 

228; 

*  2nd  letters  having  10  are  omitted  for  simplicity 


30 


TABLE  IX 
5-bit  Code  for  Acronym  of  Meaning  Character 


code 

1st  8  2nd 
meaning  acronym 

code 

1st  S  2nd 
meaning  acronym 

00000 

10000 

A* 

00001 

7V 

10001 

• 

00010 

7\ 

10010 

oh 

00011 

JL 

1001  1 

H 

C0100 

+        I 

10100 

5L 

00101 

JL 

10101 

o 

T 

00110 

L- 

101  10 

^ 

00111 

V        I 

10111 

*• 

01000 

o 

11000 

• 

01001 

c 

• 

11001 

„ 

01010 

£• 

11010  | 

I      ~        ~ 

=? 

01011 

a 

• 

1101  1 

„ 

i  — —  _ _ _   _     _— 

01  100 

Q« 

11100 

I          X 

01101 

a 

• 

11101 

_   _               _     _ 

01110 

H« 

11110 

, JL 

01111 

• 

11111 

t  -  ------ 

1,800  ).  For  these  characters  which  have  duplicate  codes 
the  users  must  apply  the  exception  rule.  Alternatively,  the 
meaning  cf  the  character  can  be  redefined  to  a  synonym  with 
a  different  acronym.  For  example,  if  the  wrong  homonymous 
Chinese  character  is  displayed,  the  input  operator  may 
select  another  form  cf  the  homonym  by  keying  in  the  full 
sound  syllables  instead  of  the  acronym. 


31 


TABLE  X 
Proportion  of  Duplicate  Code 


Number  of 
character 


1,80 


pure 
acronym 
code 


7.55? 


Rearranged 
sound  cha-    | 
racter   code 


Rearranged    sound 

and   meaning 
character    code 


2.3?? 


1.8  7- 


-  •       16-hit    Code    for    Roman    Alp_ha bet    and    Syjnbo Is 

In  order  to  use  the  three  mixed  kinds  of  a  character 
code,  and  simplify  the  I/O  controller,  and  unify  the  word, 
16-bit  codes  for  the  Roman  alphabet  and  symbols  should  be 
generated  by  only  one  keystroke.  For  data  communication  and 
for  familiarity,  adding  only  the  default  byte  (00H)  to  ASCII 
code,  16-bit  code  for  Roman  alphabet,  symbols,  and  control 
characters  can  be  defined.  When  one  uses  only  Roman  alphanu- 
meric characters,  one  can  easily  convert  this  16-bit  cede  to 
ASCII  code.  Table  XI  shows  the  16-bit  code  for  Foraan 
alphabet   and  symbols. 

4 .      Keyboard    for    16 -bit    Code 

As  it  is  mentioned  in  the  previous  chapter,  the 
biggest  issue  is  how  to  enter  ail  Chinese  characters,  Korean 
characters,  and  the  Roman  alphabets  with  a  simple  keyboard. 
In  order  to  implement  the  16-bit  code  to  keyboard,  one  would 
have  to  make  the  keystrokes  which  generates  "1"  or  "0"  as  a 
Chinese  character  function  key  (bit  1)  ,  three  5-bit  codes 
(00000-11111)  for  Chinese  and  Korean  characters  (bits  2-16) 
and  16-bit  code  for  Roman  alphabet  and  symbols.  In  this  case 
33    more      keys   than      the   common      Roman    alphabet      keyboard    are 
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TABLE  XI 
Structure  of  16-bit  Code  for  Roman  Alphabet 

< 1st  Eyte >|< 2nd  Byte > 


< 00H >  < ASCII  Value 

for  Roman  alphabet 


> 


needed.  To  reduce  the  number  of  keys,  one  more  function  key 
can  te  added  for  the  Roman  alphabet  which  generates  16-bit 
code  as  a  5-bit  code  key.  However,  the  user  identification 
will  be  complex  because  there  must  be  4  or  5  letters  on  each 
key  top.  Table  XII  explains  the  4  alternatives.  Alternative 
I  includes  32  Roman  alphabet  on  5-bit  key  tops  using  a  Roman 
alphabet  function  key  and  Alternative  II  excludes  Roman 
alphabet  on  5-bit  key  top.  Alternative  A  uses  the  acronym  of 
sound  and  meaning  characters  and  Alternative  B  uses  only  the 
acronym  of  sound  character  for  the  sound  and  meaning 
characters. 

To  select  the  best  one,  the  authors  can  use  the 
following  criteria:  flexibility  of  hardware  and  software 
design,  hardware  efficiency,  ease  of  maintenance,  system 
reliabilty,  user  characteristics,  number  of  keystrokes, 
number  of  duplicate  cedes,  and  complexity  of  recognizing  a 
certain  keystroke  [Ref.  2].  In  the  opinion  of  authors' 
alternative  II-B  should  be  selected  for  the  ease  of  explana- 
tion and  understanding.  By  selecting  Alternative  II-B,  a 
user  can  type  the  three  characters  simultaneously.  For 
example,  to  type  "School  is  JS_JL  (Hak-kyo)  in  Korean  and 
JfcLM.  (Hak-kyo)  in  Chinese  character",  one  can  type  directly 
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TABLE  XII 
Leveled  Letters  on  32  Key  Tops 


Alternative  I 

I-A  I-E 


1 


2  |3  14 


16 


Legend : 


Alternative  II 
II-A            II-B 
+ 


1 


2  |3  |4 


+ + 

2   3   4 


5   |6 


2   3   4 


Roman  alphabet 

First  letter  of  Korean  character 
Second  letter  of  Korean  character 
Third  letter  of  Korean  character 
Acrcnyra  of  Chinese  sound  character 
Acronym  of  Chinese  meaning  character 


only  Reman  alphabets  in  the  previous  sentence  without  a 
function  key.  Two  syllables  of  "school"  are  formed  from:  In 
Korean,  the  first  syllable  is  "_o_"  selected  from  the  posi- 
tion of  first  sound  letter,  "_^_"  from  the  second  sound  posi- 
tion, "JL"  from  the  third  position.  The  second  syllable  is 
"Jl,  ^L,  Default".  Then,  in  Chinese,  press  the  Chinese  char- 
acter function  key  which  generates  "1"  as  the  first  tit.  The 
first  syllable  is  " Jj^"  which  is  the  acronym  of  first  meaning 
character  and  "JL"  which  is  the  acronym  of  second  meaning 
character  and  "j£_"  which  is  the  acronym  of  sound  character. 
The  second  syllable  is  "  oH  Jl  it".  After  typing  Chinese 
characters,  user  must  release  the  function  key  to  type 
Korean  characters.  Table  XIII  explains  the  above  example. 

As   a  result   of  the   above   example,   the   computer 
generates   the   following  codes   in   hexadecimal:    0053(S), 
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r  ■  ■■ 

TABLE  XIII 
Typing  Procedures  for  Mixed  Characters 

Tc 

type  "school,  t£  H,  ^  /£•$;",  the  following 
procedures  should  be  follwed: 

1. 

Type  "schocl"  by  one  keystroke  for  each  character 
without  a  function  key. 

2. 

In  Korean,  to  type  "_*£  JjL", 
first,  type  "_o,  Jl,  J_" , 
second,  type  "Jl,  it ,  Default". 

3. 

In  Chinese,  to  type  "j5£  ^£",  at  first,  press 

the  function  key,  then  type  '•^•,  t,  o«  » , 

"o*,  JL,  JL"  (Table  IX)  since  for  "^P" , 

the  meaing  is  "J^  Jt"  and  the  sound  is  "S_",  and 

for  ">$&",  the  meaning  is  "_5l  i~_"  an^  the   sound 

is  "iL". 

. 

0063(c),  0068(h),  006F(o),  006F(o),  006C  (1)  ,  and  0  (Korean 
character),  11111  (JfL)  #   00010  (J*_),   0000  1  (JL)  I   that  is  °111 

1100  0100  0001  (  7C  4 1 :  »■£  )  .  0  (Korean  character),  00001  (J_)  , 
10010  (ii),  00000  (Default)  ;  that  is  0000  0110  0100  C000(  06 
a0:J?:J,  and  1  (Chinese  character),  01010  (Ml)/  10100  (jh)  , 
11110  (©i);  that  is  1010  1010  1001  1110(  AA  9E:j£_),  and 
1  (Chinese  character)  ,  11110  (©i),  00010  (JL),  11110(JL);  that 
is  1111  1000  0101  1 1  10  {  F8  5E:j&_). 

5  •   Operating  System  for  Input  an d  Output 

To  apply  the  suggested  system,  it  is  needed  to  rede- 
sign  the  operating   system  for   input   and  output   control. 
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Read   from 
keyboard 


Display 
on  CRT 

and 
send  to 

buf f or 


I 


Road   froi 
keyboard 


D  i  G  p  1  a  y 
on  CRT 

a '.  i  d 
send  to 

buffer 


la. 


FR :  a  flag  register  for 
Chinese  character 


Figure  4.3    Flowchart  of  Input  and  Output  Controller. 
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Figure  4.3  shows  the  flowchart  of  input  and  output  control. 
First,  the  input  and  output  controller  has  to  distinguish 
whether  the  Chinese  character  function  key  is  "0"  or  "1".  A 
flag  register  can  be  used  for  Chinese  character  function 
key.  For  example,  if  the  flag  is  "1",  then  "1"  is  loaded  to 
the  first  of  the  16  bit  register,  and  multiple  5-bit  codes 
are  read  until  the  register  is  full.  If  it  is  full,  a  char- 
acter is  displayed  on  the  CRT,  and  the  16-bit  code  is  sent 
to  a  buffer  as  data.  Otherwise  the  flag  is  "0",  and  then 
"0"  is  leaded  into  the  register,  and  5-bit  codes  are  read 
for  a  Korean  character.  A  16-bit  code  is  used  for  a  Roman 
alphabet  character  or  a  symbol  code  until  the  16- tit 
register  is  full.  If  the  16-bit  register  is  full,  the  iden- 
tified character  is  displayed  and  sent  to  a  buffer  as  data. 
If  the  "stop  edit?"  condition  shown  in  Fig  4.3  is  "no",  the 
input  and  output  controller  makes  a  loop  to  read  a  code, 
displays  a  character  and  sends  a  character  code  to  a  buffer. 
This  system  will  make  the  use  of  Korean  language 
commands  and  programs  easier  to  use  than  those  presently 
available.  To  achieve  the "above  goals,  a  compiler  and  inter- 
preter, as  well  as  the  operating  system  will  require  rede- 
signing. This  system  will  require  the  complete  rewriting  of 
all  software  currently  used.  The  economic  impact  of  this  on 
the  Korean  people  will  be  enormous. 

6  •   Design  Considerations  for  Character  Genera ti on 

There  are  twe  shapes  of  characters  used  in  Korea: 
Gothic  (Figure  4.4)  and  Brush  type  (i.e.,  Ming  style:  Figure 
4.5)  [Ref.  7:  p.  34].  To  generate  the  above  shapes  of  char- 
acters, several  methods  of  a  character  generation  can  be 
considered.  To  select  the  best  method  for  Korean  and  Chinese 
characters,  one  can  use  the  following  five  criteria:  speed, 
space,  quality,  flexibilty,  and  cost.  Speed  is  a  double 
standard:   speed  of  creation  may  range  from  a  few  minutes  to 
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Figure  4.4    An  Example  of  Gothic  Type 


Figure  4.5    An  Example  of  Brush  Type. 

a  few  hours,  while  speed  of  production  should  go  teycni  1000 
characters/sec  depending  on  type  size  and  device  resolution. 
Space  refers  to  the  average  size  of  the  code  for  one  char- 
acter as  well  as  the  size  of  the  internal  buffers  often 
needed  for  decoding.  Quality  is  proportional  to  the  largest 
dot  matrix  which  can  be  used  to  decode  a  character;  it 
should  net  be  confused  with  the  resolution  of  the  output 
device.  For  a  given  type  size,  the  resolution  sets  the  defi- 
nition, that  is  the  size  of  the  matrix  to  be  used;  defini- 
tion, hence  type  size,  is  bounded  by  the  quality  of  the 
code.  Flexibilty  refers  to  the  different  automatic  modifica- 
tions which  are  supported  by  the  code;  scaling,  rotating, 
family  variations  (as  going  from  light  to  bold).  Cost  is 
self  explanatory  [Ref.  12:  p.  240]. 
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Cbviously  the  five  criteria  above  are  not  indepen- 
dent. Figure  4.6  shows  the  interrelationship  of  criteria  in 
designing  a  character  generator  [Ref.  12:  p.  211].  The  most 
desirable  feature  is  indicated  by  the  direction  of  the 
arrow.  Solid  (resp.   dotted)  lines  indicate  agreement  (resp. 
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Figure  4.6    Criteria  in  Designing  a  Character  Generator. 

contrariety)  between  the  variation  of  the  factors.  The 
design  of  a  digital  character  generator  is  an  engineer's 
task  whose  goal  is  to  strike  the  appropriate  balance  between 
the  specifications  for  those  five  criteria  combined  with  the 
characteristics  of  the  production  device,  resolution  and 
scanning,  and  the  necessity  of  operating  the  corresponding 
creation  station. 

Table  XIV  [Ref.  12:  p.  268]  gives  a  summary  of  the 
main  characteristics  of  the  coding  methods  that  the  engineer 
can  utilize  [Ref.  12:  p.  269].  As  the  characteristics  of 
Korean  and  Chinese  characters  are  compared  to  Table  XIV,  it 
should   he  apparent   that  the   bit   map  method   is  the   most 
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TABLE  XIV 
Comparative  Table  for  Performance 


Method 

Code  space 
(bits) 

Buffer 
space 
(bits) 

Rlex- 
bility 

Video 
scan 

Peso- 
lut  ion| 
(n) 

Qty 

Spd 

Bit 
map 

2 

n 

0 

+  + 

<5  0 

+  +  + 

Run- 
length 

k*n*log  n 
2 

n 

+  + 

<100 

++ 

Chain- 
link 

6*k*n 

2 

n 

<100 

+ 

Diffe- 
renti- 
al run 
length 

6*k*n  + 
b (log  n+c) 
2 

n 

_ 

+ 

<100 

— 

+ 

Spline 

k*log  n 
2 

k  *log  n 
m    2 

Struc- 
tural 

k'*log  n 
2 

2 

n 

i 

.*  Qty:  Quality 
*  Spd:  Speed 


Legend : 


+  : 

Good 

—  • 

Bad 

b: 

The 

c: 

A  co 

n: 

The 

k: 

The 

line 

of  t 

for 

10  f 

numer  of  birth  point  in  the  character 
nstant  taking  care  of  bookkeeping 
size  of  matrix 
average  number  of  runs  per  matrix 

cr  column  (a  number  of  the  simplicity 
he  character  shapes:  approximately  U 
Reman  body-text  fonts,  higher  than 
or  Chinese  characters 


appropriate  one  for  this  application.  It  reduces  code  space 
and  buffer  space.  It  has  good  video  scan,  high  speed,  and 
highly  readable  low  guality  printing.  Unfortunately  this 
method  lacks  flexibility.  However  for  all  the  other  afore- 
mentioned reasons,  the  bit  map  method  is  commonly  used  for 
Korean  and  Chinese  characters. 
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Eecause  of  the  complexity  of  Korean  and  Chinese 
characters,  at  least  a  16  by  16  resolution  is  required  for 
Korean  characters,  and  a  24  by  24  resolution  for  Chinese 
characters.  32  by  32,  64  by  64,  80  by  80,  96  by  96,  and  128 
by  128  resolutions  are  desirable  when  much  more  beauty  is 
required  and  also  when  larger  character  sizes  are  to  be 
produced.  However,  if  these  characters  are  displayed  on  a 
CRT,  with  32  by  32  resolution,  with  the  size  of  each  char- 
acter 7-10  mm  square,  this  should  be  sufficient. 

It  is  the  authors1  opinion  that  the  less  expensive 
32  by  32  resolution  CRT  should  be  used  for  softcopy.  The 
reason  fcr  this  is  that  the  price  of  the  memory  component 
required  to  hold  the  character  definitions  is  continually 
getting  less  expensive.  However  stronger  motivation  is  that 
high-speed  and  flexibility  of  typing  is  then  possible 
[Ref.  1:  p.  828].  IEM  corporation  uses  16  by  16  resolution 
for  Gothic  and  24  by  24  for  Brush  type  Korean  character 
syllables  [Eef.  8:  p.  2].  FACON  corporation  uses  30  ty  30 
dots  for  Korean  and  Chinese  characters  and  24  by  3  0  dots  for 
Roman  alphabet,  symbols,  and  Korean  alphabet  (letters)  on  a 
laser  printer  [Ref.  7:  p.  47].  As  cheaper  dot  matrix  and 
laser  printers  find  their  way  into  the  marketplace,  the 
quality  of  characters  will  become  less  of  an  issue. 
Presently  there  are  few  problems  with  the  quality  of  repre- 
senting Korean  characters  that  cannot  be  solved  through  the 
additional  expenditure  of  money.  For  the  definition  of  each 
character,  the  authors  have  presented  two  alternatives; 
software  and  hardware  (character  generator) .  In  order  to 
increase  speed  and  usability,  a  hardware-oriented  character 
generator  is  best.  If  cost  and  flexibility  are  the 
criterion,  software-oriented  character  definition  programs 
are  tetter. 
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7  .   Memory  Space  for  Ch a ra ct er  Definition 

To  represent  Korean  and  Chinese  characters,  one 
needs  to  code  5,000  characters  for  a  character  definition: 
(2,400  Korean  characters;  1,800  Chinese  characters;  800  user 
definable  characters,  Foman  alphabet,  or  symbols).  If  one 
uses  a  24  bit  by  24  bit  matrix  font  size  for  each  character, 
at  least  360  K  bytes  are  required  (3  byte  *  24  *  5,000)  for 
character  definition  and  128  K  bytes  are  required  (64  K  for 
16  bit  address  *  2  byte  for  an  address  of  character  defini- 
tion memory)  for  a  look-up  table.  The  total  memory  require- 
ment is  488  K  bytes. 

A  large  memory  space  is  required  for  the  definition 
of  the  characters.  Data  compression  of  these  characters  can 
be  considered  for  two  different  purposes:  data  transmission, 
and  computer  storage  and  output.  Here  one  is  mainly  inter- 
ested in  the  latter  case,  where  the  main  point  is  the  total 
data  amount  to  be  stored.  The  method  of  data  compression  of 
Chinese  characters  can  be  classified  by  using  the  method- 
ology listed  in  Table  XV  [Ref.  1:  p.  820]. 

There  is  a  problem  associated  with  the  enlargement 
and  alignment  of  character  patterns.  The  clarity  of  a  char- 
acter deperds  on  the  size  of  the  reproduction.  If  a  large 
size  is  required  the  resolution  must  be  high.  Otherwise, 
stepwise  zigzags  appear  which  to  some  people  are  unbearable. 
Therefore,  all  the  patterns  of  different  font  sizes  must  be 
stored.  This  is  uneconomical.  Reproducing  different  char- 
acter sizes  from  the  same  data  is  desired.  However,  the 
enlargement  and  shrinking  of  character  patterns  frcm  a 
single  set  of  data  is  quite  difficult,  because,  if  the  addi- 
tion or  the  deletion  of  a  bit  by  the  interpolation  is  not 
done  properly,  it  has  a  negative  influence  on  the  asthetics. 
In  enlargement,  the  smoothness  of  an  edge  is  particularly 
important,  while  in  shrinking  the  gap  between  strokes  must 
be  carefully  maintained. 
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TABLE  XV 
Varieties  of  Data  Compression  Methods 


transmission 


run-length  coding 

ft wo- dimensional 
predictive  coding 

coding  by  scan  line 
pattern  unit 

coding  by  m  by  n  block 
representation  ^pattern  unit 

checker  board  sampling 


page  unit 
character  unit 


dot  pattern 


memory 
reconstruction  \ 


enlarge/shrink 


stroke 
representation 

ccntour  coor- 
dinate coding 

ccntour  fol- 
lowing coding 


.hexagonal  board  sampling 


mathematical  equation 
for  strokes 
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A  comparative  review  of  the  options  contained  in 
Table  XV  with  regard  to  determining  memory  size  is  very 
difficult.  This  is  because  the  requirements  for  character 
print  qualities  are  quite  different  depending  on  each 
method.  Simplicity  in  the  hardware  and  software  implementa- 
tion of  the  compression  and  reconstruction  of  characters  is 
a  very  important  consideration.  Generally  speaking  high 
data  compression  methods  need  complex  hardware  and  longer 
times  for  reconstruction.  Therefore,  the  tradeoff  to  be 
considered  is  between  the  data  compression  ratio  and  the 
memory  size.  This  represents  the  classic  economic  tradeoff 
between  the  hardware/software  cost  with  regard  to  the  speed 
of  character  regeneration. 
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Because  the  price  of  the  memory  component  is 
becoming  less  expensive  the  high-speed  simple  reconstruction 
method  is  preferred  despite  the  necessarily  large  size 
memory.  Many  commercial  machines  have  adopted  this  concept, 
and  store  the  character  dot  patterns  as  they  are  without  any 
data  compression.  For  example,  IBM  machines  use  only  a  12  by 
24  font  size  for  simple  letters  (Roman  and  Korean  alphabet 
and  symbols)  instead  of  a  24  by  24  font  [Ref.  8i  p.  17]. 
The  FACOH  machines  use  the  software  definition  of  the  second 
level  of  Korean  and  Chinese  characters  which  are  not  used 
frequently  for  data  compression  [Ref.  7:  p.  12].  Because  of 
the  reduction  in  the  price  of  memory,  the  marketplace  has 
shifted  towards  providing  direct  character  storage,  i.e.  a 
large  memory,  instead  of  utilizing  data  compression. 
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v-  IIALOaTION  OF  SUGGESTED  METHODS 

The  principal  protlems  in  current  editing  technology  for 
Chinese  and  Korean  characters  were  detailed  in  Chapter  III. 
Fundamentally,  the  problems  cause  user  inconvenience, 
require  lengthy  input  procedures,  and  result  in  complex 
update  requirements.  The  authors'  suggested  methods  will 
solve  most  problems  which  are  encountered  in  Korean  language 
data  processing.  More  research  and  development  remain  in  the 
following  areas: 

First,  in  an  enumeration  method,  there  is  nc  problem 
except  low  readability  to  Koreans  and  the  inability  to 
represent  Chinese  characters.  In  this  case,  Chinese  charac- 
ters are  ignored  because  Korean  language  data  can  be  repre- 
sented through  the  use  of  only  Korean  characters  without 
sericus  problems.  Low  readability  is  caused  by  unf amiliarity 
and  the  unbalanced  shape  of  each  letter  when  written  by  an 
enumeration  method.  With  a  minor  change  of  shape  of  the 
letters,  this  method  will  eliminate  the  above  problems. 

Second,  the  16-bit  code  for  the  three  kinds  of  charac- 
ters requires  the  consideration  of  the  following  problems: 

1.  The  32  key  tops  are  complex  since  each  key  top  repre- 
sents three  or  four  letters  and  acronyms.  One  solu- 
tion to  this  problem  is  to  use  lighted,  changeable 
key  tops  which  represent  only  one  letter  or  acronym 
at  a  moment  according  to  the  function  keys  and  the 
order  of  keystrokes(  i.e.,  1st,  2nd,  3rd  letter  and 
acronym  ) . 

2.  The  user  must  remember  whether  a  letter  to  be  typed 
is  the  first,  second,  or  third  letter,  and  whether  it 
is  an  acronym  of  a  sound  or  a  meaning  character. 
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3.  If  a  user  does  not  know  the  meaning  of  a  certain 
Chinese  character  to  be  typed,  one  must  look  up  a 
table  which  shows  all  meanings  and  sounds  of  all 
possible  Chinese  characters. 

4.  In  typing  Korean  characters  which  consist  cf  only 
first  and  second  letters,  a  user  has  to  hit  the 
default  key  to  make  a  16-bit  code.  Instead  of  second 
letter  and  default  keys,  one  can  use  twenty  one  more 
second  letter  keys  which  generate  10-bit  code  as  the 
second  and  third  letters.  Unfortunately ,  this  will 
make  the  keyboard  more  complex. 

5.  Regardless  of  the  authors'  analysis  of  sound  and 
meaning  characters  and  careful  rearrangement  of  these 
codes,  duplicate  codes  still  exist.  This  is  because 
of  the  irregularities  caused  by  a  natural  evolution 
cf  sound  and  meaning  characters  for  over  2,000  years. 
Generally  3,000  or  more  Chinese  characters  will  cause 
duplicate  codes  to  increase  proportionally.  In  order 
to  eliminate  the  duplicate  codes,  the  Korean  language 
committee  needs  to  take  measures  to  clarify  the  mean- 
ings of  the  Chinese  characters  that  cause  duplicate 
codes  to  exist. 

Before  the  actual  construction  of  the  suggested  system, 
an  economic  (Cost/Benefit)  analysis  needs  to  be  considered. 
Given  the  r  %  discount  rate  and  the  various  yearly  costs  and 
benefits  estimated  by  past  data,  Table  XVI  [Ref.  14]  shows 
the  following  formula  which  can  be  used  to  derive  the  net 
present  value  of  this  project:  This  simply  states  that  the 
net  present  value  (NFV)  is  equal  to  the  sum  of  the  differ- 
ences between  benefits  (B)  and  costs  (C)  in  each  year  (i)  of 
the  project  life  (T) ,  divided  by  the  relevant  factcr  (r)  for 
that  year.  The  current  estimate  of  the  market  size  for  word 
processing  in  Korea  is  $  2.5  million  annually  (Korean  Daily 
Times,  Sep  10  1934).    But  this  estimate   will  be  in  inverse 
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TABLE  XVI 
Net  Present  Value  Formula 


<3-  "  c-> 

T      11 

NPV  =  Z 

i=1        i 

(r) 


Legend:    NPV:  Net  Present  Value 

B:  Benefit 

C:  Cost 

i:  Each  year 

T:  Project  life 

r:  P.elavent  factor 


proportion  to  the  price  of  the  system  and  will  be  in  direct 
proportion  to  the  usefulness  and  the  user-friendliness  until 
maturation. 

In  the  above  formula,  the  market  price  of  the  system 
influences  the  benefits  for  manufacturers  and  costs  for 
users.  This  system  is  feasible  when  the  net  present  values 
are  positive  for  both  manufacturers  and  users.  If  the 
benefits  for  manufacturers  and  the  costs  for  users  are 
constant  in  a  system,  the  main  problems  will  be: 

1.  How  to  minimize  the  costs  for  manufacturers 

2.  How  to  maximize  the  benefits  for  users 

To  solve  the   above  problems,   the  best  approach   will  te  to 
make   an  efficient   and  user   friendly   system   for   Korean 
language  data  processing.   This  will  increase  the  number  (N) 
of  systems  sold,  and  increase  the  individual  productivity  of 
the  users. 

There  are  many  factors  and  constraints  which  cause  high 
cost  in  implementing  this  method.  Among  these,  the  following 


47 


three  factors   affect  the   cost  performance   ratio  for   both 
manufacturers  and  users: 

1.  The  initial  design  cost:  For  this  system,  an  organi- 
zation has  to  invest  initially  for  the  design  of 
about  5,000  Korean  and  Chinese  character  patterns, 
and  the  system  software  and  hardware.  As  the  number 
of  systems  produced  by  a  manufacturer  is  increased, 
the  unit  cost  of  each  system  will  be  decreased  as  the 
costs  are  spread  over  more  units. 

2.  Cost  for  character  generator:  As  mentioned  in  Chapter 
IV,  one  needs  about  500  K  bytes  memory  capacity  for 
these  character  definitions.  The  cost  of  memory  is 
decreasing  and  the  speed  is  increasing  as  technology 
is  being  develcped.  This  cost  is  an  initial  cost  to 
users  when  buying  a  system. 

3.  Cost  for  hardcopy:  One  can  consider  three  kinds  of 
printer  for  hardcopy:  dot  matrix  printer,  chain 
printer,  and  laser  printer.  It  is  not  practical  to 
use  a  chain  printer  for  our  system  since  the  chain 
will  be  approximately  twenty  meters  long  (  5,000 
character  *  4  mm  per  each  character)  and  it  would  be 
prohibitively  slow.  Currently,  dot  matrix  printers 
and  laser  printers  cost  more  than  chain  printers,  but 
they  are  the  only  viable  option. 

Among  the  three  kinds  of  cost,  the  third  one  is  the  most 
serious  since  the  cost  of  hardcopy  is  increasing  as  its  use 
increases.  Recently  laser  printers  have  become  more  popular 
for  these  characters  because  of  the  good  quality,  high  speed 
and  decreasing  price.  Comparatively  though,  laser  printers 
are  still  relatively  expensive. 
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VI.  RECOMMENDATION  AND  CONCLCJSIOII 

As  the  demand  for  data  processing  in  Korea  increases, 
users  will  continue  to  encounter  more  and  more  problems  in 
utilizing  the  Korean  language  for  data  processing.  The 
current  methods  of  convergence,  display  and  select  to  imple- 
ment the  Korean  language  in  data  processing  must  only  be 
considered  as  interim  measures  due  to  their  inefficient  and 
time  consuming  means  cf  data  entry.  In  order  to  prevent  this 
problem  from  becoming  more  complicated  due  to  the  develop- 
ment of  various  new  implementations  forwarded  by  independent 
research,  a  standardized  system  must  be  developed. 

This  study  examined  two  possible  solutions  for  using  the 
Korean  language  in  data  processing.  The  enumeration  method 
is  technologically  feasible,  inexpensive,  easy  to  implement, 
but  could  not  be  used  for  applications  within  the  Korean 
data  processing  environment.  This  is  because  it  results  in  a 
textual  form  of  Hangul  that  is  unfamiliar  to  most  Korean 
people.  Therefore  the  current  enumeration  method  is  not  a 
feasible  solution  to  the  Korean  data  processing  problem. 

The  second  method  examined  was  based  on  a  16-tit  cede 
representation  of  Korean,  Chinese  characters,  and  the  Roman 
alphabet.  This  method  was  found  to  possess  all  the  advan- 
tages currently  realized  by  the  EBCDIC  or  ASCII  code  repre- 
sentation of  western  countries.  The  only  drawback  to  this 
system  is  that  it  might  not  be  cost  effective  based  on 
current  technology.  However,  due  to  the  rapid  development  of 
hardware  and  software  technology,  a  cost  effective  means 
should  be  available  within  the  next  few  years. 

In  order  to  accelerate  the  determination  of  a  thorough 
broad  based  solution  to  the  Korean  data  processing  problem, 
the  Korean   government  must  organize   and  charge   a  national 
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level  committee  with  the  responsibility  for  investigating 
the  problem  and  determining  a  viable  solution.  This  study 
with  its  proposal  cf  a  16-bit  character  code  should  be 
provided  to  that  committee  for  further  examination.  This 
proposal  represents  a  concept  that  could  eventually  lead  to 
a  long  term  viable  solution  to  the  data  entry  and  processing 
problems  of  using  the  Korean  language. 
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APPENDIX  A 
THE  EVOLUTION  OF  CHINESE  CHARACTEES 
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APPENDIX    B 
EBCDIC    INPUT    CODE 
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APPENDIX  C 
MDS  INPDT  CODE 
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APPENDIX  D 
IDH  MD1TISTATI0N  5550  KEYBOARD 
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APPENDIX  E 
FACCfl  OS  IV  (KEF)  KEYBOARD 
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APPENDIX  F 
LOAD  COMMAND  PROGRAM  FOR  CORRENT  KEYBOAR® 
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APPENDIX  G 
LOAD  COMMAND  PROGRAM  FOR  CAPITAL  LETTER  KEYBOARD 


l(a- 

]  tt    K 

1  ■  !  - 
1"*" 

lM" 

1"? 

1"' 

1", 

1". 

1"/ 

1M0 

1-: 

1-2 
1-3 

1»4 

1-5 
1-6 
1-7 

1-3 
1-9 

l": 

1-2 

l-a 

1  "  3 

1-C 

l-n 
l"? 

1"F 
i-G 
l"M 

1  "P 
1-S 
1"T 
i  -v 

1"  M 

l-x 

i"Y 

1"Z 

i  »* 

1-a 
l"c 

1  -d1 
I"?1 


lee") 

oo; 

0000007F 
00000000 
00000008 
00000000 
00000000 
00  0  00039 
00000005 
000CC009 
0C0C0O05 
00FF0101 
007F41 41 
007F3808 
007F0C24 
00494949 
00000040 
00003000 
000000  0  0 
00G00001 
0C000001 
OOOOOGOC 
0C000000 
0C300075 
00  3  00000 
30000COO 
0000001C 
30000077 
00000077 
00000030 
00000000 
3000001C 
00000000 
0000  3  000 
00  3  00000 

ocoooooo 

3000000C 

ocoooooo 

00000000 
OC000006 
307F4040 
007F0000 
007F0131 
001C007F 


01010101020c; 

7F0314224141  ; 
3E483E093E08  : 
007F0022227F  ; 
00001813; 
390979090909 
052527 70 05 3 5 
39793  9  793  9  09 
7D1525504545 
0101010101C1 
4  1  4  1  4  1  4  1  4  1  7  p 
142241414141 
24242424  247F 
4  9  7  F  <♦  9  4  9  <♦  9  7  F 
4040  7E40404C 
00  0  80808037F 
OOOCOO  307F  ; 
0.1  3101030579 
701121 2741^1 
7C414  141  4  1  7F 
7F4040404C  7F 
15  1577-5-5  77 
7F30  7F-0407F 
7F013C0102GC 
007F03 142241 
093909112244 
151575454577 
77  1111714172 
404040404 0  7F 
007F00  3E-1  3E 
0408  1  4  Z2<*  l  41 
1Z12324C494S 
721212254949 
41  *  1  7F41 4  1  7P 

1C224141221C 
771015  7540  77 
7F J10  1  7F^07F 
4G4F4C464976 
40-0404040  7F 
7F -040404  0  7F 
013F0101C10  1 
33  1422^14141 


l-k"000000004444447C4444 
1-1-00OO0CO1017F09091161 
l-m-000000007F 1414 141414 
l-n"00000002023E023E0202 
1-O-000000011111U7F0101 
1-D-0  03003040424277C0404 
l"q-0040404040404G40407F 
l-r-0ClC007F00lC224122  1C 
1-S-0C010204C8 142241 4141 
l"t" 0C7F242424344A-94949 
l"u"300000000C1414  14147F 
l*v"  G0774<,<.<«<»4-444444477 
l"iu"  00414141417F4141417F 
1-X-001C224141  <*141  4122  1C 
l-y-00000040407C4C7C404C 
1-J-007F010  1C17F4040407F 
l"t-000000  10  30  50  1010107C 
1"(M0000  3  07F02040E0141 3E 
1")"  00Q000040C  1<«24  7F0404 
1-^«0000003C42010E30407E 

1-1" 0000001C22405E51211E 

1  -  U  "  OC003C7F^O-:65101413E 

I'"  '000C22147F1422: 

1"«-"0016130CQ0131£: 

1«-"0000307F ; 

i-:-0000030o7F0B08: 

1 "<-000106  1860 130601  ; 

1-=" 0OOC0C000O0C0810; 

1">-0C20 180601 0618  20; 

1 -7-003E41063608303S ; 

1MJM003E41413F41413E; 

1  - K "OQ3C-2-33C01-23C: 

1-LM 003 9 444436454639; 

i»»"0ClC2245495122lC; 

1"N" 00080808083800 0  6; 

1-0-007FO 10204331020; 

1-P" 00616204 0  8102343: 

1-CM00102C'»Q40'.02010  ; 

l-\" 000604081 

1  -  3-000402  3101010204: 

1" _ " 0000307F007F; 

l»~»  004141  FF4]  <•  3  5  5  2  2  : 
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APPENDIX    H 
IBM    2-EYTE    INTEFNAL    HANGUL    CODE 


1 1 0  I  _  3  4 


•j  0  0  0  u 

;,o  t 

.*  o  t 

id-  j 

7:  e  s  t  r  i . 

0  00  0  1 

Fill 

j  a  e  «  > 

i     ;-.es  t  r  •  '. 

o  o  o  i  a 

1 

nil        : 

■  001  1 

-n 

i 

)0  100 

'■- 

^ 

00  101 

c 

t 

001  10 

l£ 

i                    J 

001  1  1 

s 

i 

01000 

0  100  1 
01010 
0101  1 
01100 
01101 
Oil  10 
Oil  1  1 

10000 
10001 
10010 
1001  1 
10100 
10101 
101  10 
101  1  1 

1  1000 
11001 

I  10  10 
11011 
11100 

II  10  1 
11110 

11111 


a 

tftj 


■  "■  1 


■)    I 


No  t  U  sea  *  ,i 

Not  Usea  •  3 

Not  Used.  3 

No  t  Us  ea  •  3 

Not  Used* 3 

Not  Us  e>i  *3 

No  t  U  sea  *3 

No  t  usea«3 

hot  Used* 3 

Not  Used -2 
Rest  r icted  * i 


.-:e  st 
Rest 


ei«| 


4 


Restricted*  1 

Rest  ri  c  t  ea*  [• 

A 


■H\ 


Restr  ictea»| 
Restricted*! 

IT 

A 

I 
Not    U  s  e  d  *  3 
Net    Used* 3 


r, 
a 

d< 
a 


Not    Used  *  j 

Fiestnc  ted 
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